Neural Net Model for Featured Word Extraction

نویسندگان

  • Atin Das
  • Matus Marko
  • A. Probst
  • M. A. Porter
  • Carlos Gershenson
چکیده

Search engines perform the task of retrieving information related to the user-supplied query words. This task has two parts; one is finding ’featured words’ which describe an article best and the other is finding a match among these words to user-defined search terms. There are two main independent approaches to achieve this task. The first one, using the concepts of semantics, has been implemented partially. For more details see another paper of Marko et al., 2002. The second approach is reported in this paper. It is a theoretical model based on using Neural Network (NN). Instead of using keywords or reading from the first few lines from papers/articles, the present model gives emphasis on extracting ’featured words’ from an article. Obviously we propose to exclude prepositions, articles and so on, that is , English words like "of, the, are, so, therefore, " etc. from such a list. A neural model is taken with its nodes pre-assigned energies. Whenever a match is found with featured words and userdefined search words, the node is fired and jumps to a higher energy. This firing continues until the model attains a steady energy level and total energy is now calculated. Clearly, higher match will generate higher energy; so on the basis of total energy, a ranking is done to the article indicating degree of relevance to the user’s interest. Another important feature of the proposed model is incorporating a semantic module to refine the search words; like finding association among search words, etc. In this manner, information retrieval can be improved markedly.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

EXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS

Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...

متن کامل

Vector Quantizer Signal Transform

This paper deals with the problem of combination of Neural Networks (NN) and traditional statistical pattern classiiers. It is shown that a Neural Network can be used to replace the vector quantizer (VQ) and some feature extraction and feature reduction modules in a discrete pattern recognition system. A criterion for training the NN-weights and the classiier jointly is derived, leading to the ...

متن کامل

ارائه روشی برای استخراج کلمات کلیدی و وزن‌دهی کلمات برای بهبود طبقه‌بندی متون فارسی

Due to ever-increasing information expansion and existing huge amount of unstructured documents, usage of keywords plays a very important role in information retrieval. Because of a manually-extraction of keywords faces various challenges, their automated extraction seems inevitable. In this research, it has been tried to use a thesaurus, (a structured word-net) to automatically extract them. A...

متن کامل

Modeling and Optimization of Anethole Ultrasound-Assisted Extraction from Fennel Seeds using Artificial Neural Network

Extraction of essential oils from medicinal plants has received researcher’s attention as it has a wide variety of applications in different industries. In this study, ultrasonic method has been used to facilitate the extraction of active ingredient anethole from fennel seeds. Effect of different parameters like extraction time (20, 40, and 60 min), power (80, 240, and 400 Watts) and solid part...

متن کامل

Connectionist Feature Extraction for Conventional Hmm Systems

Hidden Markov model speech recognition systems typically use Gaussian mixture models to estimate the distributions of decorrelated acoustic feature vectors that correspond to individual subword units. By contrast, hybrid connectionist-HMM systems use discriminatively-trained neural networks to estimate the probability distribution among subword units given the acoustic observations. In this wor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره cs.NE/0206001  شماره 

صفحات  -

تاریخ انتشار 2002